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ABSTRACT 

We present a galaxy catalog simulator which turns N-body simulations with Friends-of-Friends halo 
catalogs and IsoDen subhalos into mock, multiband photometric catalogs. The simulator assigns 
galaxy properties to each subhalo in a way that reproduces the observed cluster galaxy halo occupa- 
tion distribution, the radial and mass dependent variation in fractions of blue galaxies, the luminosity 
functions in the cluster and the field, and the color-magnitude relation in clusters. Moreover, the 
evolution of these parameters is tuned to match existing observational constraints. Field galaxies are 
sampled from existing multiband photometric surveys of similar depth using derived galaxy photo- 
metric redshifts. Parametrizing an ensemble of cluster galaxy properties enables us to create mock 
catalogs with variations in those properties, which in turn allows us to quantify the sensitivity of 
cluster finding to current observational uncertainties in these properties. 

We present an application of the catalog simulator to characterize the selection function of a galaxy 
cluster finder that utilizes the cluster red-sequence together with galaxy clustering on the sky. Com- 
pleteness varies with mass and redshift, and contamination roughly constant at around 35% out to 
z r*> 0.6. We estimate systematic uncertainties due to the observational uncertainties on our simulator 
parameters in determining the selection function using five different sets of modified catalogs. Our 
estimates indicate that these uncertainties are at the < 15% level with current observational con- 
straints on cluster galaxy populations and their evolution. In addition, we examine the utility of the 
B gc parameter as an optical mass indicator and measure the intrinsic scatter of the _B gc -mass relation 
to be approximately log normal with ct\ o&10 m ~ 0.25. Finally, we present tests of a red sequence 
overdensity redshift estimator using both simulated and real data, showing that it delivers redshifts 
for massive clusters with ~2% accuracy out to redshifts z ~ 0.5 with SDSS-like datasets. 



1. INTRODUCTION 

Connecting baryonic cosmic structures with dark mat- 
ter in the universe is very important in understand- 
ing the evolution of large scale structure. This is 
the key to understanding the implications of large ob- 
servational surveys within the context of theoretical 
and numerical studies of large-scale structure evolu- 
tion. Within this field of study clusters of galax- 
ies have long been recognized as importa nt laborato- 
ries for, e.g., studies of galaxy evolution (jFaber et al.l 
l2007t lHansen et al.1 120091) . int racluster medium enrich- 
ment (e.g.. iLin fc Mohrll2004 more references therein) 
and critical sign posts for studies of the evolution of 
the large scale structure and cosmolo gy through proper- 
ties such as their mass function (e.g. iWhite et all Il993t 
iVikhlinin et al.ll2009b iVanderlinde et al 1120101) and their 
clustering (e.g. Miller et alJl2001t iHutsilboiOj ). 

Galaxy cluster surveys are becoming one of the 
key tools for unveiling the nature of the dark energy 
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(IWang fe SteinhardUlT99 8; Hai man et aTll200lD . For ex- 
ample. Idadders et al.l f|2007T ) recently used a moder- 
ate scale galaxy cluster survey to constrain cos mological 
parameters using the self-calibration method (|Hu et al.l 
[20031: iMaiumdar fc Mohrl [20031 EOfll Other analyses 
using Sloan Digital Sky Survey (SPSS lYoTk~et^[200l 
have used a large ensemble o f nearby clusters to study 
cosmology (jRozo et al.l 120101 ) . Although these optical 
surveys delivered large numbers of clusters, the cosmo- 
logical constraints were only moderately strong. This 
points towards the need to better understand the sys- 
tematics of cluster surveys. The key areas of con- 
cern include the cluster selection and the cluster mass 
estimation. Without continued progress in these ar- 
eas we will not be able to capitalize on the rich cos- 
mological information emerging from, for example, the 
South Pole Telescope Su nyaev-Zel'dovich effect survey 
(jStaniszewski et all 120081 ). t he Dark Energy Sur vey and 
the eROSITA X-ray survey (jPredehl et al.l l20Toh . 

Many simulations have successfully reproduced obser- 
vational features in clusters and galaxies, such as the 
luminosity function or the correlation function. This 
usually has been done in one of two ways: using the 
framework of halo occupa t ion distribution (HOD ) (e.g., 
IBerlind fc Weinberg! [2001 IKravtsov et~aTI 12004) which 
can also depend on luminosity (e.g.. I Yang et al.l 120031 ) 
and by using a semi-analytical mode l for galaxy evo- 
lution in merger history trees (e.g., iKauffman n et al.l 
1999). In the HOD framework, galaxies are populated 
within halos assuming a certain HOD and described by 



2 



Song et al. 



a conditional probability P (N \ M) that a halo wit h 
mass M contains N galaxies (Berlind & Weinberg 2002). 
I Yang et"aLl ()2003l ) extended the HOD modeling by label- 
ing the galaxies with luminosities that use the conditional 
luminosity function (CLF) P(N | M)dL, which gives the 
probability of finding a galaxy with luminosity in the 
range of L ± dL/2 as a function of halo mass M. These 
methods are powerful tools to populate dark matter par- 
ticle simulations with galaxies, but one must rely on the 
adopted shape of the HOD or a specific galaxy evolution 
model. Certain aspects of galaxy evolution models, in 
particular, are still quite uncertain. 

A more recent scheme to c reate mock catalogs uses the 
Hubble Volume Simulation (Ev rard et al. 2002 J], where 
dark matter particles were chosen to be galaxies based on 
their local density, regardless of the host halo mass, and 
luminosities were assigne d by the observ ed luminosity- 
density relation in SDSS (jWechslerl [2004D . Another re- 
cent exampl e of relating luminositi es and galaxies was 
developed bv lVale fc Ostrikeil (|2006l ) using a subhalo cat- 
alog. In that study, they introduced a non-parametric 
model to relate the luminosity of galaxies and the mass 
of the dark matter halo or subhalo which hosts it, un- 
der the assumption that the luminosity-mass relation is 
monotonic. Their scheme, however , did not reproduce 
the observed LF in lLin et all (|2004| ). 

Our objective is to provide simulated galaxy catalogs 
that reproduce many of the properties of the large scale 
structures in the universe, so that we can use these cat- 
alogs to analyze and calibrate various automated opti- 
cal and NIR galaxy cluster analysis tools. Moreover, we 
want our catalog simulator to allow us to turn the current 
observational uncertainties into uncertainties on the se- 
lection function and our ability to estimate masses in the 
optical. To this end we assign the subhalo catalogs from 
the N-body simulation to be galaxies, and then directly 
implement observational features of galaxies in various 
wavebands without assuming any HOD, semi-analytical 
model, or luminosity-mass relation. 

This paper is organized as follows. In fj2j we describe 
the N-body simulation, including the creation of our sub- 
halo catalog. In £j3j we introduce our method for as- 
signing galaxy properties to subhalos. In SJSJ a clus- 
ter finder is tested against the mock catalogs and an- 
alyzed. In fj6] we discuss the reliability of the optical 
mass indicator, the B gc parameter. Finally in [J3J ad- 
vantages and limitations of our approach are discussed, 
as well as future directions of the project. Throughout 
the paper, we assume the cosmological parameters to be 
£Im = 0.3, — 0.7, and the Hubble parameter to be 
H Q = 70 km s" 1 Mpc" 1 . 

2. N-BODY SIMULATION 

The catalog simulator uses subhalos from an N-body 
simulation to determine the locations and dynamics of 
galaxies. Below we describe the simulation and extrac- 
tion of the halo and subhalo catalogs. 

2.1. Halo Catalogs 

We carried out a cold dark matter (DM) only structure 
formation simulation with the Hashe d Oct-Tree (HOT) 
algorithm, initially described in iWarren fc Salmon! 

7 http:/ /www. physics. lsa.umich.edu/hubblc-volumc 



(1993), running at Los Alamos National Laboratory. We 
modeled a universe based on the concordance cosmol- 
ogy (0 M = 0.3,17a = 0.7, H Q = 70 km s' 1 Mpc" 1 ). 
Initial conditions were der ived from transfer function s 
calculated by CMBFAST (|Seliak fc Zaldarriagal [1991 . 
This DM simulation was performed with the same code 
and paramete r s as t he series of simulations described in 
IWarren et al.l (|2006() . The simulation followed a cubic 
computational volume 816.3ft _1 Mpc across with 1280 3 
(2.1 billion) particles. Each particle had a mass of 
3.63 x 10 9 M Q . 

DM ha los are identified b y the Friends-of- Friends (FoF) 
method (|Davis et al.lll985l ). The FoF method identifies 
a set of spatially associated particles, roughly contained 
within an isodensity surface with a value of 

(h \ - aM P 

where M p is the particle mass and a is a constant of order 
2. For each object, one can identify how many linked 
particles it contains. Here the adopted linking length, 
hunk, is 0.2 times the average inter-particle spacing. 

2.2. Subhalo Catalogs 

Subhalos are identified independently from halos. The 
subhalo finding algo rithm, called the IsoDen method 
iPfitzner et al.l (|1997l) . selects all density enhancements. 
Subhalos are traced to the smallest clumps which contain 
at least 10 particles, corresponding to a subhalo mass of 
3.6 x 1O 1O M0. The IsoDen method calculates the spatial 
density field of particles to identify local peaks as centers 
of subhalos. At each center, iso-density surfaces grow to 
find all particles that belong to the center until any two 
different surfaces start to touch each other. 

Subhalos are then paired up with host halos if they lie 
within the virial radius r2oo of the host halo, within which 
the mean density is 200 times higher than the mean den- 
sity of the universe. Since the simulation output box 
is periodic, subhalos on the edges of the box are also 
checked for their membership with hosts on the other 
side of the box. When a subhalo matches with two dif- 
ferent host halos, we choose the more massive halo as the 
host. Although FoF masses (that are given for the host 
halos) a re not exactly sam e as spherical overdensity (SO) 
masses. fLukic et ail (|2009T ) found that the agreement be- 
tween the two masses is good (within 5%) for most halos 
(~85%) in their N-body simulations. The other 20% of 
the halos had clear substructures and non-relaxed ha- 
los. We assume that for all of our halos, the agreement 
between FoF mass and SO mass is good enough for our 
purpose. We note that, however, this is one of the sources 
of scatter in the following comparison. 

We examine the matched subhalo catalog to see 
whether the basic properties such as the subhalo mass 
functions (SMF) and the radial distributions are consis- 
tent with observations of cluster galaxies. For example, 
to compare the mass function of the field population with 

that of cluster members, we scale the field SMF by ^22. 

. . . M 

because the density in cluster DM is by definition en- 
hanced by 200 times compared to the field within the 
virial region. Here we take clusters to be those halos 
with mass greater than 5 x 1O 13 M that contain mem- 
ber subhalos that were matched with the halos within 
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^200 (i-e., no galaxy-free clusters are allowed). The field 
population includes lower mass halos along with their 
member subhalos (mass less than the above mass cut), 
as well as subhalos that were not matched to any halo. 
In Figure [TJ filled circles represent the field population 
and stars represent cluster members. For this plot, the 
subhalos were not cut off by their definition in mass (i.e., 
10 particles or more or mass greater than 3.6 x 1O 1O M . 

One characteristic to note is that there is a cutoff on 
the low mass end for subhalos in the field and in clusters 
that is driven by the mass resolution of the simulation, as 
denoted by the dotted line. Note that the field SMF ex- 
tends to roughly three times lower mass than the cluster 
SMF. For the purposes of catalog simulation, we choose 
the mass threshold of subhalos at 3 x 10 10 Mq, which 
allows us to use the bulk of the available population of 
cluster subhalos. The difference between field and cluster 
SMF turnover is an indication that subhalos must have 
been destroyed in dense regions, and particle groups of 
lower mass than this threshold are not found in sufficient 
numbers as subhalos. 

Also in Figure [TJ we find an overdensity of cluster sub- 
halos as compared to field subhalos after scaling by a 
factor of In the if s -band, the normalization offset 
between the cluster luminosity fu nction (ILin et al.| | 2004|) 
and the field luminosity function (jKochanek et alj|2001j) 
is about ~3.5, which is consistent with the offset at a 
mass range around 5 x 10 11 Mq in the subhalo mass func- 
tion if we assume differences in K s band mass to light 
bet ween the field and the cluster are not significant. Note 
that iGao et all (|2004[ ) find an offset in the subhalo abun- 
dance distribution in massive halos as compared to that 
in th e Universe as a, who le. Our result is also consistent 
with iGiocoli et al.l (|2008|) , who report that the normal- 
ization of the unevolved subhalo mass function is higher 
than the SMF in the host halos by a factor of two. 
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Fig. 1. — Subhalo mass function: We plot the number density of 
subhalos per log 10 M200 as a function of subhalo mass for the field 
subhalos (dots), scaled by and the cluster subhalos (stars). 

We define all halos with masses M200 > 5 X 1O 13 M0 as clusters. 

In Figure [2] we show subhalo radial profi les compared 
with an NFW profile (jNavarro et al.l 11997ft . A total of 



12,000 halos are stacked to produce this figure and Fig- 
ure [JJ For the radial profile in Figure [2] these halos are 
then separated into three mass bins, where only first 
3,000 halos are shown. The lowest mass bin, shown 
with the dashed line, represents halos with mass between 
5 x 1O 13 M and 2.0 x 10 14 M Q , and the next mass bin, 
shown with the dash-dot line, contains halos with mass 
between 2.0 x 10 14 and 1O 15 M and the highest mass 
bin contains all remaining halos that reach a mass up to 
3.5 x 1O 15 M . We compare these radial profiles of the 
subhalos with the NFW profile (solid lines) , with a con- 
centration of 5 and 3. These three subhalo profiles are in 
reasonable agreement with the NFW profile, but the ha- 
los in the highest mass bin, represented with the dash line 
in the figure, show a deficit of subhalos in the central re- 
gion when it's compared to the NFW profile with concen- 
tration of 5 (for DM particles). T his deficit has already 
been noted in previ ous studies (e.g-. IDe Lucia et al]|2004t 
IGhigna et al.ll200GT) and is referred to as 'antibiased' rel- 
ative to the dark ma t ter in the central regions of the 
halos. iDe Lucia et al.l (|2004ft suggested that this is natu- 
rally explained as a consequence of the orbital decay com- 
bined with dynamical friction and mass loss that subha- 
los experience in high density regions. Subhalos that are 
on orbits that take them through the cluster center are 
quickly destroyed and soon are no longer distinguishable 
from the central halo. Also, observed galaxy radial pro- 
files in clusters are generally well fit by an NF W model, 
but with a lower concentration of 3 or 4 (e.g. ILin et al\ 
12001 12001 . In our catalog generator these DM subhalos 
are used as galaxies, and so any radial properties of the 
subhalos will be reflected in the resulting galaxy popu- 
lation. For testing cluster finders that do not assume 
a specific shape for the cluster galaxy profile, this mis- 
match between the radial profiles of the cluster galaxies 
and the subhalos is not crucial. 
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Fig. 2. — Subhalo radial profiles: We plot the density of subhalos 
as a function of distance r/r200 from the center of their parent halo. 
The profiles have been normalized by the total number of subhalos 
within each host halo's virial radius T200- Halos in the mass range 
of [5e+13,2e+14], [2e+14,8e+14],[le+15,3.5e+15] in units of M© 
are shown with cyan dash-dot-dot, magenta dash dot, and black 
dash lines, respectively. For comparison we plot an NFW model 
with a concentration c = 5 (the upper blue solid line) and c = 3 
(the lower blue solid line) for the DM particle distribution. 
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3. CATALOG SIMULATOR 

The catalog simulator works on the halo and subhalo 
catalogs described in the previous section. A 2-D sky 
view from an observation is a projection of a lightcone 
where the volume depends on redshift extent, cosmologi- 
cal parameters and the angular size or field-of-view. Ide- 
ally, one would like to build a mock catalog from a large 
enough simulation to include self-consistently the cosmic 
expansion and structure formation evolution along the 
lightcone. In the absence of such a simulated light cone, 
one can use one or more outputs from a periodic simu- 
lation box. In the tests described in the sections below 
we use a single z = simulation output and stack it as 
needed to simulate the more distant universe. Therefore, 
the simulations used in this paper do not naturally follow 
the evolution of the underlying dark matter large scale 
structure. 

To build our lightcone we set an observer at a ran- 
dom position inside the box with a random line-of-sight 
direction - additional boxes are located along the cho- 
sen line-of-sight to simulate the higher redshift universe. 
This random realization makes it possible to build dif- 
ferent mock catalogs from the same simulation. Cluster 
and galaxy redshifts are assigned using their comoving 
distances from the observer and their peculiar velocities. 
The angular diameter distance together with the field of 
view determines the portion of the simulation that is in- 
cluded in the simulated catalog at each redshift. Once 
redshifts are determined, galaxy properties are assigned 
to reproduce several observed quantities, including the 
luminosity function, the color distribution and the halo 
occupation number (HON). In the following sections, we 
describe how observational constraints are used to guide 
the catalog simulator. 

3.1. Galaxy Luminosity 

In principle we could use the subhalo mass to assign 
a luminosity. However, the mapping from dark matter 
mass to stellar mass (and therefore luminosity) is com- 
plex, because of the stripping processes that affect the 
DM halos and stellar populations differentially and that 
depend on the orbits of the galaxies through the clus- 
ter. Therefore, rather than do this, we assign galaxy 
luminosity randomly to each subhalo in a manner that 
reproduces the observed luminosity function in the ap- 
propriate environment. We treat the BCG separately. 
Because there is no strong mass segregation of subha- 
los, these two approaches should produce similar results. 
Moreover, with our approach we parametrize the field 
and cluster galaxy populations using Schechter luminos- 
ity functions, and we can vary those populations system- 
atically to explore the sensitivity of cluster finding and 
mass estimation to these properties. In addition, with 
this approach we have the flexibility to change the num- 
ber density of field galaxies relative to cluster galaxies by 
altering the luminosity parameters. 

When assigning luminosities to galaxies, we follow the 
observed i^ s -band LF as giv en bvlKochanek et al.l (|2001[ ) 
for the field population and ILin et al.l (|2004[ ) for cluster 
members (for parameter values see Table [TJ. Both stud- 
ies found a Schechter function as a good fit with the faint 
end slope fixed for local samples. Within the observed 
uncertainties, we match the two LFs by adopting with 



the corresponding faint end slope, a (fixed). Luminosi- 
ties of BCGs are deter mined using the ob served relation 
between Lbcg — 

Af9nn (ILin fc M ohr 2004|); 

T / !U \ 0-26±0.04 

Lbc % =4.9±0.2( M T ) (1) 

We impose a flux limit in drawing random luminos- 
ity from a LF that corresponds to the limit where the 
number of subhalos would match the expected number 
of galaxies in the luminosity function. This flux limit is 
determined differently for the clusters and the field. For 
the clusters, we compare the HON in the simulated clus- 
ters (the number of subhalos wit hin each h o st ha lo) to 
the HON in real clusters found by ILin et all (|2004D ; 

/ u \ 0.87±0.04 

where -ZV200 is the number of galaxies within i?2oo with 
an absolute K-band magnitude of -21.0, approximately 
Af^+2.5. This comparison provides a corresponding flux- 
limit. 
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Fig. 3. — A plot of how deep one can go down in simulating a 
survey, depending on underlying subhalo populations. Black x-, 
y- axes with the black solid line (clusters) and red dot-dashed line 
(field) show how different lower mass threshold of subhalos to be 
included in a mock catalog determines K s - limit by changing sub- 
halo number density in the simulation. The cluster and field lines 
are essentially the subhalo mass function curves in the absolute K s 
magnitude space. Blue axes with blue solid line show a SDSS-like 
survey depth in r- band at 22.5 corresponds to which absolute mag- 
nitude in K s - at different redshift. With subhalo lower mass limit 
of 3.0 X 10 10 Mq indicated by the green dashed line, the survey 
limit in X-band determined by subhalo density in clusters is deep 
enough at redshift greater than ~0.2, as the green line intercepts 
with cluster and field mass function curves. 

For the field population, we find the survey depth 
TABLE l 

2MASS LF PARAMETERS 







a 


Cluster^) 


-24.34 ±0.01 


-1.1 


Fieldt 2 ) 


-24.16 ±0.05 


-1.1 



< ^Lin et al] l2004l) ; ^Kochanek et al.l 120011') . 
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from the total number density of subhalos above a cer- 
tain mass cut (i.e., 3 x 1O 1O M0) in the simulation box 
by comparing it to the number density of all galaxies 
(by integrating t he LF to the survey depth) found by 
IKochanek et al.l (|2001l ). The resolution of this subhalo 
catalog is sufficient to push this survey depth down to 
-17 or -18, deep enough for a SDSS-like survey for all 
but the very lowest redshifts (Figure [3]), as the intercept- 
ing magnitude in absolute K s -band (~-17.5) between the 
adopted subhalo mass cut (3.0 x 10 10 M© shown with the 
green line) and the field subhalo number density in abso- 
lute K s space is deeper than a SDSS-like survey limit in 
absolute K s at redshift of 0.2 and beyond. The vertical 
flat region is due to the subhalo mass limit in the current 
simulation that we are using (i.e., there are no subhalos 
with mass less than 1.0 x 10 10 M Q in clusters). 

In determining the flux limit we evolve the of both 
field and cluster LFs with redshift according to the BC 
model (see H3.3[) . This is equivalent to an assumption 
that galaxies in both clusters and the field are red 
galaxies (i.e., no recent star formation), which is rea- 
sonable because if-band light is less affected by recent 
episode of star formation. We explore the effects of differ- 
ent field versus cluster population evolution in Section l3.5l 
below. 

In the process describe d above we also u se the observed 
HON redshift evolution. iLin et all (|2006f ) examined the 
N-M relation evolution by combining their local sample 
with a sample of several dozen clusters extending to red- 
shifts z ~ 1. Modeling the evolution as 

N(M,z) = N(M,z = 0)(1 + z) lHOD . (3) 

they showed that the data currently suggest very weak 
evolution of the HON. We adopt this form for the HON 
evolution with a fiducial "{hod — and vary this pa- 
rameter to explore the impact of variations in the HON 
evolution on cluster finding. 

3.2. Galaxy Type 

Recent studies (jWeinmann et a l. 2006) have measured 
the fraction of blue galaxies within groups and clusters as 
a function of halo mass, halocentric radius, and luminos- 
ity using a group and cluster catalog that was optically 
selected from SPSS PR 2 in th e redshift range between 
0.01 and 0.2 (|Yang et all 120051 ). Results indicate that 
the blue fraction is hi gher in the outskirt s of halos and 
in less massive halos. IGerke et al.l (|2007l ) extended the 
measurement of the blue fraction to higher redshift and 
to the field. They measured the cluster blue fraction as a 
function of radius from the cluster center, showing that 
the blue fraction gradually increased with distance from 
the center, approaching the blue fraction in the field. 

This provides a convenient and simple way to model 
the galaxy populations in our simulated catalogs. We 
adopt an approach where clusters and the field contain 
only two types of galaxies - blue and red, with blue being 
star-forming galaxies and red being passively evolving 
galaxies. The variations in color of each of these types 
of galaxies are described in the next section. We further 
assume that clusters contain a special type of red galaxy 
- a brightest cluster galaxy (BCG)- which is the most 
luminous galaxy that is typically located near the center 
of the cluster. 



We construct a functional form for the blue fraction 
to include variation with halo mass M 2 oo, distance from 
the halo center , and galaxy absolute magnitude in 
-ftT s -band Mk, by assuming variations in blue fraction 
associated with these three parameters are separable: 

fbiue(z) = A(M 200 )f b (—)f b (M Ka )(l + (4) 

7*200 

where A(M2oo) is a parameter associated with halo mass, 
fb(-r— ) is the blue fraction as a function of — — , fb(Mu) 

J V ^200 ' ^200 J V ' 

is the blue fraction as a function of its K s magnitude 
and the redshift evolution is assumed to be a power law 
with index jbiue- The adopted blue fraction as a func- 
tion of luminosity is measured in r-band, but because 
the primary band in our scheme is K s , we assume an 
arbitrary fixed color r-K s of 4.0 in using the published 
results to calibrate our model. To find the parameter 
A{M20o)i we integrate the blue fractions over the radial 
profile and the luminosity function within the virial re- 
gion. Once A(Af2oo) is determined for halo masses from 
log 10 M2oo = 11-85 to log 10 M2oo = 15.55, each galaxy 
is assigned a probability of being blue using Equation [I] 
together with the halo mass, the subhalo position within 
the halo and the assigned K s band luminosity. Beyond 
the virial radius we have a field population, which is 
taken to have a blue fractio n of 80 %. 

Since IButcher fe Oemlerl (|1984f ) presented evidence 
that the fraction of the blue galaxies in clusters in- 
creases with redshift (the BO effect), many studies 
have attempted to replicate thei r work through different 
star-formation indica tors (e.g.. iKodama fc Bowerl [20011 : 
iPoggianti et~aT1 12006ft . There are, however, many com- 
plications involved in attempting to define the evolu tion 
of the fraction of blue galaxies fsee lGerke et al.ll2007l and 
references therein). A recent study on the evolution of 
the b lue fraction in groups and in the field (|Gerke et all 
120071 ). measured the blue fraction in groups as a function 
of redshift between 0.75 and 1.3. They found a nearly 
constant blue fraction in groups in this redshift range, 
and a rising field blue fraction beyond 2 = 1. Since 
there is no single self-consistent observational study on 
the evolution of blue fractions in clusters at different red- 
shifts ranging from local to the very distant universe, we 
parametrize the blue fraction evolution using "/blue shown 
above, which allows us to explore the impact of more or 
less rapid evolution in the galaxy population. 

3.3. Galaxy Color 

Once every galaxy is assigned a if s -band magnitude 
and an identity as either a blue or red galaxy (note that 
BCGs are modeled as red galaxies), colors in optical and 
near-IR bands follow. Red galaxies have a characteris- 
tic spectrum with no strong emission lines and a promi- 
nent 4000 A break since their stars have been passively 
evolving since their last episode of star formation at high 
redshift. We use a Bru zual & Chariot model (BC03; 
IBruzual fc Charlotll200l) to assign the colors for the red 
galaxy population. We assume a single burst of star for- 
mation at z — 3 and let the galaxies passively evolve 
to z = according to a single stellar population (SSP) 
synthesis model. We use six different models with six 
distinct metallicities corresponding to different luminosi- 
ties, allowing us to construct a tilted red-sequence. These 
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SSP models are then tuned in metallicity so that the 
tilt of the color-magnitude relation for the Coma cluster 
(z = 0.0234) is reproduced. The models provide appar- 
ent magnitudes in ugrizJHK s bands at specific redshifts 
out to z = 2.98 and at six different luminosities at each 
redshift. We interpolate using these models to generate 
model colors at intermediate redshifts and luminosities. 
Since each of the models contain only a single popu- 
lation of stars, we introduce scatter in the metallicity- 
luminosity relation to model variations among galaxies 
(i.e., to introduce intrinsic scatter in the color- magnitude 
relat ion). The observational ly motivated scatter is 0.075 
mag iBarkhouse et al.l (|2006l) . 

The fact that blue galaxies with current or recent star 
formation activity are have in general complex star for- 
mation histories (e.g., O'Connell et al. 1997) makes them 
more difficult to model. Therefore, we sample colors of 
blue galaxies directly from a subset of the FLAMEX0 
database. This database contains photometry in the 
BRIJK S bands, as well as for the Spitzer IRAC [3.6] [4.5] 
[5.8] [8] micron filters, and measured photometric red- 
shifts of 175,000 galaxies. We choose only those galaxies 
with well measured fluxes in NIR bands, leaving us with 
about 36,000 galaxies. To couple a galaxy with represen- 
tative colors, we divide the adopted subset of FLAMEX 
into further subsets according to redshift and iv^-band 
luminosity. For each simulated galaxy, we choose the 
subset which is closest to its redshift and -ftf s -band lu- 
minosity, and then randomly select an observed galaxy 
from that group, with the additional constraint that it 
be bluer than the corresponding halo's color-magnitude 
relation at that redshift. 

3.4. Observational Effects 

Once magnitudes are determined for each galaxy, ob- 
servational uncertainties are applied to this model uni- 
verse to produce a realistic simulated catalog. The 
galaxy catalog simulator can choose a specific telescope 
or instrument with a specific survey depth to add noise 
to the galaxy magnitudes. An integration time is deter- 
mined such that the corresponding signal-to-noise ratio, 
which includes both systematic and statistical uncertain- 
ties, reproduces the measured uncertainties for the selec- 
tion of telescope or instrument. The S/N is given by, 

9 rr 2 

_ sre 

N \J a lrc + a bkg + a sys 

where cr src , a^kg are uncertainties in measured flux of 
a source and sky brightness (as a background), respec- 
tively, while o~ S y S is the systematic uncertainty, which 
is band-dependent for a specific instrument. The cho- 
sen survey depth will impose a magnitude cut on galax- 
ies and typically our final catalogs contain galaxies with 
S/N > 5 (i.e., a mag < 0.2). 

3.5. Variations in Galaxy Populations 

One of the advantages an empirical catalog simulator 
like ours is that any input parameter, such as the adopted 

8 FLAMINGOS Extragalactic Survey is a deep imaging survey 
covering 7.1 deg 2 sky in the J and K s niters. The purpose of this 
survey is to s tudy galaxy and galaxy cluster evolution at 1 < z < 2 
HElston et al.|[2o6g'l 



LF for clusters and the field or the evolution in the HON 
and the galaxy color distribution, can be changed to cre- 
ate a different mock catalog within the same simulation 
output. Changing one parameter at a time to produce 
another mock catalog can be done, and comparing the re- 
sults from the primary run to those from other runs with 
modifications allows us to test how much the parame- 
ter changes affect the catalog and the followon cluster 
finding or mass estimation. This provides a straightfor- 
ward way to examine the systematic uncertainty of the 
selection function for the cluster finder due to the uncer- 
tainties in the creation of the mock catalogs. The gen- 
eral trend of the modifications is physically motivated, 
but the exact values for the deviations are not observed 
quantities. In our model, there are five parameters that 
can be modified: the blue fractions as a function of phys- 
ical properties of host clusters and galaxies, the evolution 
of the fraction of red galaxies within clusters with red- 
shift, the luminosity function of the field population, the 
evolution of HON with redshift, and the tightness of the 
color-magnitude relation. 

4. AN SDSS-LIKE SIMULATED CATALOG 

We use the simulator to produce an SDSS-like galaxy 
catalog. In the final galaxy catalog, we include positions, 
redshifts, peculiar velocities, photometric redshifts, mag- 
nitudes in ugriz in the AB system, JHK S magnitudes 
and the [3.6] and [4.5] micron Spitzer IRAC bands in 
the Vega magnitude system. Table [2] shows the input 
parameter values for the primary run. 

The resulting galaxy catalog is then subjected to a se- 
ries of validation tests which include logN-logS surface 
density for each band, color-magnitude relations for clus- 
ters, galaxy luminosity functions of clusters, halo occu- 
pation numbers, radial profiles, and blue fraction as a 
function of radius and mass. Some of the basic features 
of this catalog appear in Figure |U The top figure shows 
the logN-logS for both galaxies in the simulated cata- 
log (solid lines) and those for ~ 12deg 2 of the sky from 
the SDSS archive (dash lines). The bright end for the 
SDSS galaxies is noisy due to the small sky area. It is 
clear that the two distributions are in good agreement at 
magnitudes brighter than the flux limit of the simulation 
(at r~ 22). The bottom figure shows the redshift distri- 
butions for the catalog compared to SDSS. The solid lines 
show the distribution of assigned photometric redshifts 
in the simulation, while the two dashed line shows the 
measured photometric redshift distribut ion that is avail- 
able in public SDSS DR7 data archive (|Abazaiian et al.l 
2009) . Both solid lines and dashed lines are divided into 
two sets according to galaxies' magnitude. The black 
lines toward the lower redshift is for brighter galaxies 
with r magnitude cut at 20 or brighter, while the red 



TABLE 2 
SDSS-Like Catalog Parameters 



Property 


Parameter 


Value 


Field Luminosity Function 


K* 


-24.21 


Cluster Luminosity Function 


K* 


-24.33 


Blue Fraction Evolution 


Iblue 


0.00 


HON Evolution 


1HOD 


0.00 


Red Sequence scatter 




0.07 
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Fig. 4. — Comparison in galaxy counts and redshift distribution: 
Top: Solid lines represent galaxy counts for the mock, with blue 
for it-, green for g-, yellow for r-, red for i- and black for z- bands. 
Dashed lines exhibit the same distribution for SDSS. Bottom: The 
redshift distribution of the mock (solid lines) and the measured 
photometric redshift distribution in SDSS DR7 (dashed line). The 
black lines arc galaxies brighter than 20 mag in r-band and the red 
lines are galaxies fainter than 20 mag in r-band. 

lines toward the higher redshift is for galaxies fainter than 
r of 20. The general trend between the solid lines and 
the dashed lines agree with each other, but the SDSS 
galaxies show a somewhat smoother distribution. We 
note that the depth of the two datasets are not identical, 
although this particular version of the mock catalog is 
generated to simulate an SDSS-like survey. In the mock 
catalog, galaxies are cut off at the magnitude of 22.5 in 
r-band, while the SDSS galaxies used in this analysis 
are not. Fainter galaxies must have larger photomet- 
ric uncertainties which would make photometric redshift 
measurement less reliable. 

Figure [5] shows an example of the color magnitude dis- 
tribution of galaxies toward a cluster, which has mass of 
4.5 x 10 14 M Q and a redshift of 0.21. The dotted line rep- 
resents the red-sequence at redshift of 0.2 by the same 
galaxy models used in the galaxy simulator. In order 
to test how well a simulated red-sequence represents a 
real cluster's red-sequence at a given redshift, we con- 
struct a redshift estimator based on the red-sequence of 



-I 



16 



20 



22 



Fig. 5. — Example of color-magnitude diagram. This contains 
a cluster with mass of 4.5 X 10 l4 M Q at redshift 0.21. Red dots 
are the cluster members. The model red-sequence line at z=0.2 is 
shown. 

clusters, which in principle is co mparable to the redshift 
estimator of the RCS group (see lGladders fc Yeel ([2005) 
for details), and measure redshifts of clusters based on 
the simulated catalog. We use the same red galaxy mod- 
els by BC03 that we use in the galaxy simulator described 
in ^3.31 The detail of the tests of the redshift estimator 
is described in fj7l In the tests, we measure compara- 
ble scatter in redshift estimates for simulated clusters as 
compared to our ensemble of clusters based on SDSS- 
Chandra joint dataset below z — 0.6. This demonstrates 
that the simulated galaxy colors are consistent with the 
real galaxy colors in clusters with z < 0.6. 

5. EVALUATING A RED SEQUENCE CLUSTER FINDER 

In this section, we test a cluster finder - the Voronoi 
Tessell ation and Percolation ( VTP) cluster finding algo- 
rithm (|Barkhouse et all 1200611 using the simulated cata- 
logs described above. This cluster finder code is still un- 
dergoing development and testing to optimize the cluster 
detection parameters (Barkhouse et al. 2011, in prepa- 
ration). In the VTP algorithm, clusters are detected as 
spatial overde nsities by Voronoi Te ssellation and Percola- 
tion method ()Ramella et al.|[2001[ ) within redshift shells 
defined using the expected cluster galaxy colors in the 
color-magnitude relation. Because the finder searches in 
color-magnitude space, its output includes the redshift 
estimations as well as positions and richness measure- 
ments. The VTP finder has been run on the SDSS DR6 
to build a cluster galaxy catalog where these test results 
are utilized. Completeness and false-positive (contami- 
nation) tests show how effectively this finder detects clus- 
ters in the simulation. 'Halos' refer to the dark matter 
halos in the simulation, while 'clusters' refer to the sys- 
tems that are recovered by the cluster finder. 

The VTP algorithm first filters the galaxy catalog 
to select galaxies within 3a of the corresponding red- 
sequence at each redshift. The redshift-dependent red- 
sequence location in color-magnitude space is determined 
by assuming a stellar synthesis model with a star forma- 
tion ep och with a burst at z = 5 prior to passive evo- 
lution ( Koda ma fc Arimotol 119971 ). Redshifts from the 
finder, therefore, can be biased by the assumed model. 
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In order to correct this bias, the raw prediction on red- 
shift by the VTP finder is calibrated with redshifts in the 
mock catalog, since the mock catalogs use an SSP model 
with a different star burst formation time (z = 3). 

5.1. Completeness s(M2oo, z) 

We analyze the VTP cluster selection results to mea- 
sure the completeness and contamination of the resulting 
cluster catalog. Note that completeness and contamina- 
tion measures are sensitive to some degree to how the 
matching of the cluster and halo catalogs is done. In 
this study, we match the VTP cluster candidates and 
the true dark matter halos by drawing a boundary for 
each dark matter halo with its r2oo and then selecting all 
VTP clusters that lie within this boundary. In cases of 
multiple cluster-halo matches, we select the more mas- 
sive halo as the true match. The completeness s(M2oo, z) 
is defined as the fraction of detected halos of that mass 
and redshift out of all the halos in the mock catalog with 
that mass and redshift. 

Figure [B] shows the completeness of the primary run. 
Note that the completeness is a strong function of mass 
with essentially a cutoff at some threshold mass that in- 
creases with redshift. The completeness in the lowest 
redshift bins (< z >~ 0.06) shows poorer performance 
than that in the next two bins (< z >~ 0.17 and 0.29), 
and this may be because these nearest clusters of galax- 
ies are projected onto a much larger portion of the sky as 
compared to high redshift systems, effectively decreasing 
the signal to noise of a given system. Note also that at 
these redshifts the limited number of subhalos does not 
allow us to populate the cluster and field populations to 
the full depth of the flux limited surveys, and this may 
also impact our results. The error bars shown include 
only Poisson noise and therefore reflect only the statisti- 
cal uncertainties on our completeness measurements. 



halos. To measure this we cross-match the VTP clusters 
with dark matter halos, requiring a separation of equal to 
or less than each dark matter halo's r2oo- Clusters with 
the smallest deviations in both spatial and redshift space 
are flagged as the best candidates. Clusters with no over- 
lapping halos are taken as contamination. The number 
of these false clusters relative to the total number of de- 
tected clusters determines the contamination. Figure [7] 
shows the contamination fraction versus the redshift as- 
signed to the contaminating cluster. Clusters at all red- 
shift where a SDSS-like survey probes, i.e., z < 0.4, show 
a very steady contamination level at around the 40% 
level. Note that we do not examine the mass distribu- 
tion of the contaminating clusters, because our only mass 
estimate in the case of contaminating clusters comes from 
the measured optical mass indicator B gc , which is a poor 
indicator of cluster mass (see $6]) . 




Fig. 7. — We plot the contamination fraction c(z) of all VTP 
clusters found that do not match to any true halo in the simu- 
lations. At SDSS depths the VTP cluster catalog exhibits low 
contamination at z < 0.6 




14 

log, (M 200 ) 

Fig. 6. — A plot of the completeness s(M2qq, z) versus cluster 
mass and redshift for halos that are recovered by the VTP finder. 
Different redshift ranges are encoded using different colors. 



5.2. Contamination c(z) 

We define the contamination c(z) to be the fraction 
of detected VTP clusters not matched with dark matter 



5.3. Estimating Systematic Uncertainties on 
Completeness and Contamination 

The completeness and contamination measurements 
presented above determine the best estimate of the selec- 
tion function for the VTP finder, which can then be used 
in the cosmological interpretation of a VTP cluster cat- 
alog. An important element of any analysis is to include 
not only the best estimate of the selection function but 
also the associated uncertainties. These uncertainties de- 
rive from current observational uncertainties in physical 
properties of the cluster and field galaxy populations, 
and they are therefore systematic in nature- e.g., an en- 
hanced blue fraction at high redshift will make it system- 
atically more difficult to select clusters at those redshifts. 
We can use those observational uncertainties to charac- 
terize the uncertainties in the VTP selection function, 
and of course as the observational constraints on optical 
properties of the cluster and the field galaxy populations 
tighten we can improve our simulated catalogs and re- 
duce the systematic uncertainties on the selection func- 
tion of the cluster finder. We proceed by creating several 
more simulated catalogs with input parameters varied 
around the best measured values. Then we run the VTP 
finder on each of these modified catalogs to determine the 
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selection in each case. By comparing the selection from 
a modified catalog to that derived from the primary cat- 
alog one can estimate the effect of changing each simula- 
tion parameter on the performance of the finder. Below 
we illustrate how this can be approached by carrying out 
such an analysis on the VTP cluster finder. 
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Fig. 8. — The change in completeness due to changes in the blue 
fraction, which depends on the i^-band luminosity, the mass of 
the host halo, and the galaxy's distance from the host halo center, 
as well as redshift evolution factor in the form of (l+ z )5; tle ! where 
Iblue is -1-6. Results show the change as a function of mass for each 
redshift slice considered between the fiducial blue fraction model 
and one that is perturbed by la in its parametrization. The bottom 
panel shows the systematic uncertainty in the completeness due to 
the la observational uncertainty in this blue fraction. 

The residual completeness, defined as the completeness 
of a modified catalog where the completeness of the pri- 
mary run is subtracted, is shown in several figures: one 
for tests of the impact of blue fraction (Figure HJ), HON 
evolution (Figure IH]), intrinsic scatter in the red sequence 
(Figure [TU)) . and relative luminosity of the field and clus- 
ter populations (Figure If f I) . In each figure the bottom 
panel collects the results from each redshift, scales to the 
estimated la uncertainty on completeness due to this ef- 
fect, and presents no uncertainties to improve the read- 
ability. 

In Figure [5J a modified catalog is generated with blue 
fraction as a function of M 2 oo, r 200, and Mk 3 (see 
t j3.2j) and with redshift evolution included in the form 
of (l+z)j ;ue . The adopted "fbiue is -1.6 so that the blue 
fraction at redshift of 1 to be 80%. Since the blue frac- 
tion depends on several factors, it's not possible to quote 
one value for the blue fraction at certain redshift. The 
adopted jbiue is somewhat arbitrary in this presented 
exercise, since how blue fraction in clusters evolves with 
redshift is still controversial. The fact that there are less 
red galaxies in clusters at high redshift, the selection be- 
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Fig. 9. — The change in completeness due to changes in the evo- 
lution of the HON with 7=+l (diamond) or -1 (triangle). Each 
panel with different color coded symbols represents different red- 
shift bin of which the sequence is shown in the lowest panel. The 
lowest panel shows the systematic uncertainty in the completeness 
due to the la observational uncertainty in HON. 

comes not as good as in the primary catalog test. This 
is shown as dashed lines above the primary catalog com- 
pleteness shown at zero. Figure [9] shows the comparison 
in completeness for the HOD modification run where the 
evolutionary factor, ^hod = ±1 is implemented. With 
+1 shown with diamond, for example, there are more 
galaxies in a cluster of a given mass at higher redshift, 
and this makes it easier to detect these systems. The tri- 
angles show the other case with — 1 , where there are fewer 
cluster galaxies at higher redshift, and so the finder de- 
tects fewer systems. Figure [TU] shows two different levels 
of intrinsic scatter in the red-sequence: one with smaller 
(blue) - ~ 0.02 to 0.03 - and the other with larger scat- 
ter (red) - ^0.12-0.15 - than the fiducial scatter in the 
primary catalog. This test illustrates how the intrinsic 
scatter of the color-magnitude relation alters the perfor- 
mance of the cluster finder. The smaller the scatter is, 
the more clusters are recovered by the finder, for exam- 
ple, since the finder relies on the existence of the red- 
sequence to isolate cluster galaxies from the field galax- 
ies projected nearby. In Figure llll triangle (diamond) 
symbols represent the case where the of the field LF 
is brightened (dimmed) compared to that of the cluster 
LF. The completeness shown in the lower redshift bins is 
counter-intuitive, where in high-z the change is too small 
to see the effect. It is currently under investigation why 
the performance of a cluster finder gets better when the 
field galaxies get brighter, but a possible explanation is 
that brightened field galaxies at high z contribute to sig- 
nals of clusters at lower z. We also note that the changes 
are at a level of <1% which could be random fluctua- 
tion. Changing the field galaxies to make them brighter 
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or fainter than the cluster does not have a significant 
impact on the completeness. 
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Fig. 10. — The change in completeness due to changes in the 
intrinsic scatter of the cluster red-sequence Ajjg=0.02-0.03 (blue) 
and Ars =0.12-0. 15 (red). Again, the bottom panel corresponds 
to the la uncertainty in the completeness due to the uncertainties 
in the intrinsic scatter. 



As noted above, the bottom panel of Figures [SJ [HI UHl 
and [11] shows the systematic uncertainty in the com- 
pleteness due to observational uncertainties due to un- 
certainties in the blue fraction in clusters, the HON, the 
size of the intrinsic scatter in the red sequence, and the 
field/ cluster LF. The runs used in Figures [8j [9j ITUl and 
fTTI use characteristic values for the deviations from the 
best-fit values, which are large enough to see the effect 
of changes in the performance of the finder. We can 
then use these measured changes in completeness to es- 
timate the changes one would see for a la shift in each 
of the simulation parameters. Colors for redshift bins 
are the same as in Figure [6] and the others. Overall, it 
is clear that the existing observational uncertainties in 
the properties of cluster and field galaxy populations out 
to z ~ 0.8 do not translate into large systematic uncer- 
tainties in the selection function for the VTP finder. To 
fully characterize the uncertainties in the selection func- 
tion would require an analysis of the covariances among 
these effects. Here we are providing a demonstration of 
how one would proceed (Figure IT2l . In the limit of small 
covariances in the impact on the completeness of vari- 
ations among these key simulation parameters, one can 
simply take the quadrature sum of the contributions to 
the uncertainty from each different parameter. This leads 
directly to an estimate of the systematic uncertainty on 
the completeness as a function of mass and redshift that 
could be used in the cosmological analysis of any cluster 
sample extracted using the finder. In a similar manner 
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Fig. 11. — The change in completeness due to changes in rela- 
tionship between the field and cluster luminosity functions (LF). 
The shows the case where the field LF is made fainter, and the red 
is the case where the field LF is made bright. The bottom panel 
shows the la uncertainties in the completeness due to remaining 
uncertainties in the offset between the cluster and field LFs. 

the impact of mock catalog simulator parameters on the 
contamination could be carried out. 
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Fig. 12. — Combined uncertainty in completeness as a function 
of mass and redshift. Each la uncertainty from the different simu- 
lation parameter tests is combined in a quadrature sum assuming 
contributions from each parameter are independent. Current ob- 
servational uncertainties on the properties and evolution of cluster 
galaxy properties translate into typically ~ 10% systematic uncer- 
tainties in the survey selection function. 



6. PERFORMANCE OF THE B G c MASS ESTIMATOR 
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Fig. 13. — The B gc versus halo mass relation. Top: red symbols 
are clusters at redshift greater than 0.4. Bottom: The distribution 
in mass of clusters that lie within fixed ranges in B gc . 



Along with finding clusters one must be able to reliably 
estimate their masses to be able to use the evolution of 
the mass function with redshift to study the nature of 
the dark energy or cosmic acceleration. Establishing a 
reliable estimator in optical bands stands as a significant 
hurdle for optical cluster cosmology studies. This may 
well be because, unlike in the X-ray and SZE where the 
signature of the hot gas within the virial region is dra- 
matically enhanced by it's higher temperature and den- 
sity, the galaxies change more gradually as they move 
from the field to the cluster. In particular, red sequence 
galaxies exist not only in clusters, but also in groups and 
more generally as tracers of the large scale structure. Be- 
cause the red sequence galaxy population comes not only 
from the cluster virial region but also from the surround- 
ing cluster environment, any signature derived from this 
population (i.e. the number of red galaxies or the inte- 
grated luminosity from this population) seems to be less 
well correlated with cluster virial mass than measured 
extracted from the X-ray and SZE. Here we use our sim- 
ulated catalogs to explore one well known optical mass 
estimator. 

One measure of the strength or richness of the clus- 
ter galaxy population is using B gc , which is the ampli- 
tude in the cluster center-gal axy correlation. The B„r 
parameter, first pioneered by lLongair fe Seldnerl (11979 
and later extensively tested bv lYee fc L6pez-Cruzl (199S 
is defined as follows; 



B„ 



D 7_3 4 



Hkg 



I 7 $[Af(m ,z)] : 



(6) 



A„ 



Nnet r (3 - 7)i^ 



N, 



bkg 



(7) 



Here N^g is the background galaxy count down to an ap- 
parent magnitude mo and $[M(mo, z)\ is the integrated 
LF of galaxies up to the absolute magnitude M corre- 
sponding to mo at the cluster redshift z. J 7 is a constant 
that depends on the choice of 7, and D is the angu- 
lar diameter distance to z. As shown in Equation [5J 
B gc measurements depend on the LF that is adopted 
and th e background number counts. lYee fc Lopez-Cruzl 
( 1999) tested the sensitivity of B gc values to these two 
parameters and to the magnitude limit down to which 
the adopted LF was integrated and concluded that they 
did not strongly affect B gc values as long as the normal- 
ization of the LF was carefully measured. 

In Figure Q2] we plot the richness of the systems in 
the cluster catalog extracted using the VTP algorithm 
versus the real mass of the halos in the simulation. As 
seen in our scatter plot of the B gc vs. M 2 oo for 8000 
halos in the mock catalog (Figure Q2]), the scatter in 
this relation is quite large and the apparent correlation 
is quite weak. Red symbols in the plot, which repre- 
sent clusters at redshift higher than 0.4, show higher 
redshift clusters tend to have lower B gc values for their 
mass. As shown in Equation the measurement of 
B gc depends on the net count of galaxies through A gc 
{N net = N c i us ter members - N bkg ) which decreases with 
redshift due to the flux-limited nature of the mock cat- 
alog. This results in the preference of a lower B gc for 
higher redshift systems. We also note that the B gc mea- 
surement by the VTP finder is restricted within the red- 
sequence slice for each cluster, which results in "quanti- 
zation" in B gc values in the Figure [TBI upper panel. 




-0.2 0.0 0.2 
A log 10 (M 200 /M sun ) 



0.6 



Fig. 14. — The cumulative distribution in mass of different cuts 
in Bg C shifted so that they overlap at the 50 percentile point. Note 
that the different B gc cuts correspond to cuts in mass that are well 
described by a Gaussian with g io g1 n of 0.25 (red line). The color 
scheme is the same as in Figure 1131 



IRvkoff et al.l (|2008T ) report a best-fit relation between 
X-ray luminosity and their mass indicator (-/V200), us- 
ing the maxBCG clu sters at the redshift from 0.1 to 0.3 
(|Koester et al.l [2007T ). -/V200J their richness indicator, is 
given by the number of E/S0 ridgeline members falling 
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within i?200 of the BCG and brighter than 0.4L*. The 
best fit relation in their findings shows an intrinsic scatter 
oini = 0.86 ±0.03, which corresponds to oi O g 10 l ~ 0.37. 
That, in turn, corresponds to oiog 10 m ~ 0.25, assum- 
ing the scaling relation between X-ray luminosity and 
mass is a power-law of the form logioi ~ log 10 M 15 
(jReiprich fc B5hringerl 120021 ). Figure ITU shows the cu- 
mulative distribution of B gc scatter in each B gc bin, and 
the cumulative Gaussian distribution with a of 0.25 (red 
dotted line). Except for the highest B gc bin, the cumula- 
tive distribution is nearly Gaussian with a of 0.25. These 
data support the use of Gaussian scatter in log 10 (M) in 
the o ptical richness param eter for cosmological studies 
(e.g.. iGladders et "ail 120071 ). Interestingly, our measure- 
ment of the intrinsic scatter in the B gc mass relation is 
consistent with the intrinsic scatter estimated for A^oo 
that is extracte d by leveraging th e X-ray properties of 
galaxy clusters (jRvkoff et al.ll2008l) . 
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Fig. 15. — The red sequence overdensity toward Abell 1682 
(z = 0.226) within 0.8Mpc from the X-ray center. Top: The num- 
ber of galaxies within the area of 0.8Mpc radius from the center at 
each redshift and color which is corrected for an estimated num- 
ber of background galaxies within the same aperture size at each 
redshift and color. The solid black line is the fitted line assuming 
the overdensity distribution around the peak is Gaussian. Bottom: 
The likelihood for an overdensity to be a cluster signal compared to 
background noise is shown, which is then turned into the probabil- 
ity of being real cluster detection, assuming the background noise 
is Poissonian. 



7. A RED-SEQUENCE REDSHIFT ESTIMATOR 

In this section, we present tests of a redshift esti- 
mator that has been used in studies of SZE and X- 
ray selected gal axy clusters ()Staniszewski et all 120081 : 
High et all 120101 Suhada et al 2011, Williamson et al 
2011) and that forms the core of an optical cluster selec- 
tion method being developed for joint optical and SZE 



cluster finding (Liu et al 2011). In the case of an SZE 
or X-ray candidate, we examine galaxies within a region 
of the sky around the known center by choosing a phys- 
ical radius of r ~ 0.8 Mpc or an estimated virial radius 
depending on the application. At each redshift we 
select only the galaxies with colors appropriate for the 
red-sequence, using the SSP model (BC model which is 
the same model that we use to paint red galaxies in the 
mock catalog described in ^3.31) . We apply a correction 
for the expected number of background galaxies with this 
color at each redshift and thereby end up with a mea- 
sured overdensity of red-sequence galaxies as a function 
of redshift along the line of sight toward the X-ray or 
SZE cluster candidate. All four available filters, g-, r-, 
i-, z-, are used to look for this red galaxy overdensity at 
all redshift range in order to avoid false peaks due to de- 
generacy between colors and redshift. The degeneracies 
in colors of g-r, r-i and i-z, has been known to generate 
false peaks at transitional redshift ranges (i.e., ~0.35, 
~0.75 for z< 1) where the 4000A break moves out of one 
of the filters in colors. By scanning through the data with 
five colors, g-r, g-i, r-i, r-z and i-z, at certain redshift 
intervals, one can reinforce the real overdensity peaks to 
show up in any of these color combinations (sometimes 
in more than one colors), avoiding using g-r at redshift 
around 0.3-0.35, for example. 
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Fig. 16. — The background corrected rcd-scqucncc gala xy o ver- 
density for MS1621. 5+2640 (z = 0.426), same as in Figure [151 



We have tested this approach using 51 X-ray and op- 
tically selected clusters that lie within the SDSS DR7 
survey region. For each cluster we estimate the clus- 
ter mass and virial radius using an X-ray tempera- 
ture and the mass-tem perature scaling relation from 
(|Finoguenov et all 120011) . Figure [15] shows the results 
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for Abell 1682 and Figure [TBI shows the results from the 
rcdshift estimation of MS1621. 5+2640. On the top panel 
of the histograms, the background corrected net number 
within the cluster's r2oo is plotted in each redshift bin. 
Abell 1682 is a cluster with X-ray temperature of 7.24 
keV at z = 0.226 and MS1621.5+2640 has X-ray tem- 
perature of 7.6 keV at z = 0.426. Because the 4000 A 
break in the old stellar populations of the cluster galaxies 
moves out of g-band at about z = 0.35, the peak in the 
histogram for MS1621. 5+2640 shows up in r-z vs. z at 
the appropriate redshift, while Abell 1682 shows a peak 
in g-r vs. r histogram at the appropriate redshift. The 
bottom panel in each figure shows the likelihood of each 
detection where we assume Poisson noise. The redshift 
bin with the maximum likelihood is present is chosen as 
the initial estimation of a cluster (indicated as red asterik 
on the top panel), and we refine the redshift estimation 
using a Gaussian function fit to the overdensity distribu- 
tion around the initial peak. 

In Figure [T7] on the top panel, we show a plot of the 
photometric redshift versus the spectroscopic redshift for 
the full ensemble of 51 clusters, while the bottom panel 
shows the same test using a simulated galaxy clusters 
described in this paper. There is good overall agree- 
ment, with evidence that our estimates systematically 
higher by with the root mean square (rms) scatter of the 
photo-z's around the true redshifts (black dotted line) 
of 2.9% and an rms of 1.8% once the bias is removed. 
This systematic bias in redshift could be further reduced 
by additional tuning of the SSP models used to predict 
the red-sequence color and evolution, as supported by the 
test using the simulated galaxy cluster populations using 
the same SSP models for galaxy colors and the redshift 
estimator. 

8. DISCUSSION 

We have presented an empirical method for construct- 
ing simulated catalogs that relies upon high resolution 
dark matter-only simulations and the observationally 
constrained properties of cluster and field galaxy pop- 
ulations. This empirical approach is attractive because 
it offers the possibility to test, improve and character- 
ize the final performance of optical cluster finders and 
other tools that are used on real galaxy catalogs. This 
method can be further tuned as improved observational 
constraints on cluster and field galaxy populations be- 
come available. 

In fJ5]we have demonstrated the power of this approach 
by characterizing the selection of the VTP cluster finder. 
We have used the mock catalog to measure the contami- 
nation as a function of redshift and the completeness as a 
function of mass and redshift. The development version 
of this code performs reasonably well, with characteristic 
contamination of ^40% out to z ~ 0.55, and complete- 
ness that increases with mass and reaches characteristic 
values of around 50%. 

An advantage of our method is the ease of modifying 
the galaxy populations by altering the population param- 
eters such as the blue fraction mass dependence and red- 
shift evolution, the intrinsic scatter in the red sequence, 
the evolution of the HON and the relative brightness of 
field and cluster galaxy populations. We have used this 
property of the catalog simulator to extract the system- 
atic uncertainties on the selection function using the full 




0.2 0.4 0.6 

Spectroscopic z 

Fig. 17. — Comparison between photometric and spectroscopic 
redshifts. Top: The comparison using an ensemble of galaxy clus- 
ters that overlap the SDSS DR7 data release. The rms in blue 
shows the scatter from the best-fit line which is drawn in blue and 
includes a systematic overestimate shown in black in the photo-z's 
with the scatter of 0.029. Bottom: A comparable test is shown 
based on a galaxy catalog created by the galaxy simulator pre- 
sented in this work at a similar survey depth of SDSS. 

range of the catalog parameters that are consistent with 
current observational data. Sensitivity of the complete- 
ness to variations in these parameters is at the ~5% level 
for blue fraction, HON and red sequence scatter changes, 
but it does approach ~15% at certain redshifts. Uncer- 
tainties are only at the ~1% level due to relative bright- 
ness changes in the field and cluster LFs. We also have 
demonstrated how one would combine effects of uncer- 
tainties in different parameters on estimating selection 
functions of a cluster finder, assuming the effects of pa- 
rameters explored in this work are independent. When 
they are all combined in quadrature, the uncertainty is 
at the level of <15%. These uncertainties should be in- 
cluded in the cosmological analysis of optically selected 
cluster samples. With additional work on the optical 
properties of cluster galaxy populations, especially in the 
high redshift regime, these uncertainties can be reduced 
to enable the full statistical power of the large optically 
selected cluster samples to be realized. 

In addition, we used the ensemble of catalogs to test 
optical mass estimation (SJBJ) and redshift estimation (SJ7J). 
Our analysis shows that the B gc optical mass estimator is 
correlated with cluster halo mass but with large scatter. 
The scatter in mass at fixed B gc is approximately log nor- 
mal and about 75% (cr log (m) ~ 0.25), which is markedly 
worse performance than X-ray and SZE mass estimators 
(IMohr et alJ [19991 IQ'Hara et all l2?M IVikhlinin et al.l 
120091 : iVanderlinde et all 120101 : lAndersson et al.l 12010). 
The performance of the red sequence overdensity redshift 
estimator is better than 2% once biases possibly associ- 
ated with a mismatch between the evolution of observed 
and modeled red sequences are taken into account. 
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This project highlights the importance of empirical 
mock catalogs, not only for obtaining an accurate esti- 
mate of the selection function for a cluster finding algo- 
rithm, but also for characterizing the uncertainties in the 
selection function. Moreover, a catalog generator is an 
essential tool during the development of tools for optical 
cluster finding, mass estimation and photometric rcdshift 
measurement. The next step in this development stream 
is to extend to larger volume light cone outputs from 
other structure formation simulations and to extend the 



analyses of galaxy populations to higher redshift using 
deeper survey datasets such as those coming from the 
Dark Energy Survey. 
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